ASAR: Application-Specific Approximate Recovery to Mitigate Hardware Variability
نویسندگان
چکیده
Technology scaling in microelectronics has reached limits that are resulting in increasing variation in component design and performance characteristics. Chips and systems comprising of such components are starting to exhibit a rise in process-induced failures and soft errors. Conventional design time solutions such as conservative guardbands to hide such variations are increasingly not viable for cost and performance reasons. As an alternative, researchers have sought to expose hardware fault information to the software stack and enable a programmer to use the fault information during software development. In this work, we propose the use of Software Recovery Blocks (SRB) as a programming construct that enables a programmer to provide application-specific error recovery code. Recovery comes in two modes: a) rerunning or b) discarding the erroneous computation. While rerunning comes at a performance overhead, discarding erroneous computations could result in degraded output quality, giving a user two extreme operating points on the performance-quality trade-off curve. In order to exploit intermediate performance-quality trade-off points, this work proposes approximate recovery which is particularly beneficial to approximate-computing applications. Such applications offer a natural tolerance to errors and the work introduces a SRB extension called Application-Specific Approximate Recovery (ASAR). ASAR provides 3.8%–29.9% speedup relative to rerun for six approximate-computing applications. Furthermore, the work proposes a hybrid recovery mechanism which allows a user to set desired output quality and exploit the performance-quality tradeoff curve at a finer-granularity. Hybrid recovery uses a mixture of ASAR and rerun-based recovery to demonstrate 1.5%–11.6% speedup compared to only rerun, while maintaining user-specified output quality.
منابع مشابه
Impact of Soil Moisture Dynamics on ASAR σo Signatures and Its Spatial Variability Observed over the Tibetan Plateau
This paper reports on the analysis of a 2.5 year-long time series of ASAR wide swath mode (WSM) observations for characterizing the soil moisture dynamics. The employed ASAR WSM data set consists of 152 VV-polarized scenes acquired in the period between April 2005 and September 2007 over the Naqu river basin located on the Tibetan Plateau. For four different spatial domains, with areas of 30x30...
متن کاملMitigating the Impact of Hardware Failures on Multimedia Applications – A Cross-Layer Approach
Increasing exponentially with each technology generation, hardwareinduced soft errors pose a significant threat for the reliability of mobile multimedia devices. Since traditional hardware error protection techniques incur significant power and performance overheads, this paper proposes a cross-layer cooperative approach that exploits existing error control schemes at the application layer to m...
متن کاملThe Design, Modeling, and Evaluation of the Relax Architectural Framework
As transistor technology scales ever further, hardware reliability is becoming harder to manage. The effects of soft errors, variability, wear-out, and yield are intensifying to the point where it becomes difficult to harness the benefits of deeper scaling without mechanisms for hardware fault detection and correction. We observe that the combination of emerging applications and emerging many-c...
متن کاملSensitivity of Asar Ap Data to Wheat Crop Parameters
This paper illustrates some results obtained in the frame of an experimental campaign carried out on several wheat fields of the Matera (Italy) site, from February 2003 to June 2004. During this period, ENVISAT ASAR data, at alternating polarization mode and at different incidence angles, were acquired on the area roughly every week. Ground data such as soil moisture content, wheat biomass, LAI...
متن کاملVerifying Quantitative Reliability of Programs That Execute on Unreliable Hardware
Emerging high-performance architectures are anticipated to contain unreliable components that may exhibit soft errors, which silently corrupt the results of computations. Full detection and recovery from soft errors is challenging, expensive, and, for some applications, unnecessary. For example, approximate computing applications (such as multimedia processing, machine learning, and big data an...
متن کامل